BBN's PLUM Probabilistic Language Understanding System

نویسنده

  • The PLUM System Group
چکیده

Three key design features distinguish PLUM from other approaches: statistical language modeling, learning algorithms and partial understanding. The first key feature is the use of statistical modeling to guide processing. For the version of PLUM used in MUC-5, part of speech information was determined by using well-known Markov modeling techniques embodied in BBN's part-of-speech tagger POST [5]. We also used a correction model, AMED [3], for improving Japanese segmentation and part-ofspeech tags assigned by JUMAN. For the microelectronics domain, we used a probabilistic model to help identify the role of a company in a capability (whether it is a developer, user, etc.). Statistical modeling in PLUM contributes to portability, robustness, and trainability. algorithms. We feel the key to portability of a data extraction system is automating the acquisition of the knowledge bases that need to change for a particular language or application. For the MUC-5 applications we used learning algorithms to train POST, AMED, and the template-filler model mentioned above. We also used a statistical learning algorithm to learn case frames for verbs from examples (the algorithm and empirical results are in [4]).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BBN's PLUM Probabilistic Language Understanding System

Three key design features distinguish PLUM from other approaches: statistical language modeling, learning algorithms and partial understanding. The first key feature is the use of statistical modeling to guide processing. For the version of PLUM used in MUC-5, part of speech information was determined by using well-known Markov modeling techniques embodied in BBN's part-of-speech tagger POST [5...

متن کامل

BBN: description of the PLUM system as used for MUC-3

Traditional approaches to the problem of extracting data from texts have emphasized handcrafted linguisti c knowledge. In contrast, BBN's PLUM system (Probabilistic Language Understanding Model) was developed as part of a DARPA-funded research effort on integrating probabilistic language models with more traditiona l linguistic techniques . Our research and development goals are • more rapid de...

متن کامل

BBN: description of the PLUM system as used for MUC-4

Traditional approaches to the problem of extracting data from texts have emphasized hand-crafted linguisti c knowledge . In contrast, BBN's PLUM system (Probabilistic Language Understanding Model) was developed a s part of a DARPA-funded research effort on integrating probabilistic language models with more traditional linguistic techniques . Our research and development goals are • more rapid ...

متن کامل

BBN: description of the PLUM system as used for MUC-5

APPROACH Traditional approaches to the problem of extracting data from texts have emphasized hand-crafted linguisti c knowledge. In contrast, BBN's PLUM system (Probabilistic Language Understanding Model) was developed as par t of an ARPA-funded research effort on integrating probabilistic language models with more traditional linguisti c techniques. Our research and development goals are : • m...

متن کامل

A New Approach to Text Understanding

This paper first briefly describes the architecture of PLUM, BBN's text processing system, and then reports on some experiments evaluating the effectiveness of the design at the component level. Three features are unusual in PLUM's architecture: a domainindependent deterrninistie parser, processing of (the resulting) fragments at the semantic and discourse level, and probabilistie models. 1. I ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1993